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Abstract In many online systems, individuals provide services for each other; the 
recipient of the service obtains a benefit but the provider of the service incurs a 
cost. If benefit exceeds cost, provision of the service increases social welfare and 
should therefore be encouraged - but the individuals providing the service gain 
no (immediate) benefit from providing the service and hence have an incentive to 
withhold service. Hence there is scope for designing a system that improves welfare 
by encouraging exchange. To operate successfully within the confines of the online 
environment, such a system should be distributed, practicable, and consistent with 
individual incentives. This paper proposes and analyzes a simple such system that 
relies on the exchange of tokens; the emphasis is on the design of a protocol (number 
of tokens and suggested strategies) . We provide estimates for the efficiency of such 
protocols and show that choosing the right protocol will lead to almost full efficiency 
if agents are sufficiently patient. However, choosing the wrong protocols may lead 
to an enormous loss of efficiency. 
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1 Introduction 



In many online systems, individuals provide services to one another. Napster 
- now defunct - in which individuals shared music files is surely the most 
familiar of the systems, but there are many systems and services in cur- 
rent operation. In peer-to-peer networks, individuals share files (Gnutella 
and Kazaa), or provide computational assistance (Seti@home) or provide 
content, evaluations and answers to questions (Slashdot, Yahoo Answers). 
More broadly, many entities (although not necessarily individuals) provide 
forwarding/routing services to others. In such systems, the recipient of the 
service obtains a benefit while the provider of the service incurs a cost, 
which may be quite substantial. Assuming that the benefit exceeds the 
cost, provision of the service increases social welfare and should therefore 
be encouraged, but the agents (individuals or entities) providing the service 
have an incentive to free ride. Empirical studies show that this free-riding 
problem can be quite severe: in the Gnutella system for instance, almost 
70% of users share no files at all (Adar & Hubeman, 2000). 

In such settings, there is a great deal of scope for a benevolent designer to 
improve welfare by careful design of a system that encourages exchange. The 
extent to which this potential can be realized depends on the environment 
in which the system operates and on the degree of control the designer can 
exert. Here we show that a great deal can be achieved by implementing 
a very simple and practicable system that relies solely on the exchange 
of intrinsically worthless tokens or fiat money^ We provide estimates for 
efficiency and an effective procedure for constructing a "good" system. If 
agents are patient or the benefit/cost ratio is high, efficiency can be made 
near 1; i.e., trade will take place in almost all interactions. 

Our work has a good deal in common with the literature on exchange in 
the absence of a "double coincidence of wants" and on "money as a medium 
of exchange", of which Ostroy & Starr (1971) and Kiyotaki & Wright (1989) 
are the seminal papers (see also especially Green &; Zhou (1998) and Zhou 
(1999)), but differs from that literature in important ways. To understand 
our work and its relationship to that literature, it is useful to begin by 

1 By "tokens" or "fiat money" we mean the same thing; we use the former term to be 
consistent with the engineering literature. 
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identifying important features of our environment that are common to all 
the systems mentioned above, and to understand the special restrictions 
these features impose. The first is that agents interact directly, rather than 
through a central authority^] No central authority means no central record- 
ing of transactions and of course no centralized punishments. The second 
is that the scale is large and interactions are anonymous, so that although 
agents may interact frequently, they do not interact repeatedly with the 
same agents, so that reciprocation must be indirect, rather than direct. The 
third is that, because individuals interact only online, they cannot observe 
much information about the others with whom they interact; in particular 
they cannot observe the histories of others. The fourth is that communica- 
tion in our environment is typically costly (either in direct costs or in terms 
of delay) or impossible. 

The features identified above lead to the first desideratum for our sys- 
tem: it should be completely distributed, in the sense that the behavior of 
an agent - in particular, the decision to request /provide service - should 
depend only on the agents' own history. The second desideratum is that the 
system be practicable. At the level of agents, practicability entails minimal 
exchange of messages. At the level of the designer, practicability entails that 
the designer initializes the system (in our work this means choosing a pro- 
tocol: providing tokens and recommending strategies), but thereafter takes 
no actions; in particular, the designer does not monitor the agents or the 
system (and so cannot limit the number of tokens an agent can hold). Be- 
cause it seems impossible for the designer to know the precise parameters of 
the environment (in our setting this means benefit/cost ratios and discount 
factors), practicability entails that the system should be robust to small per- 
turbations of these parameters. Finally, practicability entails simplicity. We 
admit that we cannot provide a working definition of "simplicity" but we 
hope the results and discussion that follow will make the meaning clearer. 

2 Napster and the other systems mentioned above, and many others, do not operate 
through a central authority. Napster, for instance, merely maintained many partial lists 
(distributed across many servers) of music files available and contact information for sub- 
scribers who had these files; users seeking files could simply search these lists and then 
contact the file-holder directly. For such systems, the absence of a central authority is 
necessary - a central authority could not handle the amount of traffic generated - and 
desirable - because a central authority would be vulnerable to attacks that might crash 
the entire system. 
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These features and desiderata have implications for what is possible in 
the environments we consider and what distinguishes our work from the lit- 
erature. If there were a central authority that could record all encounter, 
then deviations from recommended strategies could be easily detected and 
punished, but as we have noted above, in our environment there is no cen- 
tral authority or central record-keeping. The fact that agents do not meet 
in person means that agents cannot inspect the token holdings of others]^] 
This makes it impossible to use procedures such as those required in the 
"two-money theorem" of Kocherlakota (2002), which yields socially optimal 
outcomes, and in Berensten, Camera & Waller (2007), and other papers, in 
which the behavior of parties in a match requires that each party knows the 
holding of the counter-party. It also seems impossible to use procedures that 
require the parties to bargain because bargaining would require either that 
each party knows the holding (hence the outside option) of the counter-party 
or that the parties bargain without knowing the holding (hence the outside 
option) of the counter-party, which (because it would take place in a setting 
of incomplete information) would require a great deal of communication. 
As we show in Theorem [5j our desire that the design be robust rules out 
mixed strategies; similar arguments (discussed in the Conclusion) show that 
robustness also rules out random or fractional offers. Because packets are 
essentially indivisible, our environment rules out fractional exchanges, but 
they would be ruled out by robustness as well. Perhaps more subtly, the 
impossibility for the designer to monitor the holdings of agents rules out an 
exogenous upper bound on the number of tokens an agent can hold; contrast 
Camera & Corbae (1999), Berentsen (2000) and others]^] As we discuss in 
Section [3j the absence of an exogenous upper bound on token holdings causes 
surprising complications (proving that equilibrium exists becomes a difficult 
task) and surprising implications (the "optimal quantity of money" may be 

3 The reader might wonder how agents who do not meet in person can exchange tokens 
at all, since they can only exchange electronic files, and electronic files would seem to be 
easily duplicated. In fact, however, there are practicable, secure and private procedures 
for online token exchange, utilizing hardware or software or both; see Buttyan & Hubaux 
(2001), Vishnumurthy & Chandrakumar & Sirer (2003) and Ciuffoletti (2010). Similar 
procedures can also serve as escrow accounts to assure that service that is promised is 
actually provided and that payment that is promised is actually made. 

4 In a somewhat different vein, Cavalcanti & Wallace (1999A, 1999B) study environ- 
ments in which some agents (banks) can be monitored and can issue tokens. 
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quite different than it is when an exogenous upper bound is imposed). 

In the environment we consider, a continuum of agents each of whom 
possesses a unique file that can be duplicated and provided to others. (In 
the real systems we have in mind, the population is in the tens of thousands 
or hundreds of thousands, so a continuum model seems a reasonable ap- 
proximation.) In each period, a fraction of the population is matched; one 
member of each match (the client) must decide whether to request service 
(provision of a file or forwarding of a packet) and the other (the server) 
must decide whether to provide the service (if requested). The client who 
receives the service derives a benefit, the server who provides the service 
incurs a cost. To simplify the analysis we assume here that, except for the 
uniqueness of the files they possess, all agents are identical, and that all files 
are equally valuable to receive and equally costly to provide. (We discuss 
extensions in the Conclusion.) We assume the benefit exceeds the cost, so 
that social welfare is increased when the service is provided, but that the 
cost is strictly positive, so that the server has a disincentive to provide it. 

We allow the designer to determine a supply of tokens and to recom- 
mend strategies (circumstances under which service should be requested or 
provided); together the token supply and the recommended strategies con- 
stitute a protocol. We assume that the price of service is fixed at one token; 
as we have noted above and discuss further in the Conclusion, robustness 
rules out fractional and random offers. We require that the protocol should 
induce an equilibrium - the recommended strategies are best replies in the 
(unique) steady-state distribution - and remain so for small perturbations 
of the population parameters. There are always degenerate equilibrium pro- 
tocols in which no service is provided in the steady state. Aside from these 
degenerate protocols, we show that the requirements of equilibrium and 
of robustness has strong implications: all robust equilibrium protocols are 
Markov in private token holdings (not history dependent) and symmetric 
(the population plays a pure strategy) and have a particularly simple form: 
clients request service whenever their token holding is above zero; servers 
provide service when their token holding is below a certain threshold and 
do not provide service when the token holding is above that threshold]^] 

5 The fact that the designer recommends the same strategy to all agents is, in part, a 
consequence of our assumption that agents are identical. If agents were of several types 
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However, these conclusions do not entirely solve the designer's prob- 
lem, because he must choose the right threshold and the right supply of 
tokens/money. If the threshold is too large, agents will not be willing to 
continue providing service and accumulating tokens up to the threshold; if 
the threshold is too small, agents will not be willing to stop providing ser- 
vice and accumulating tokens when they reach the threshold; both of these 
violate the equilibrium incentive conditions. If the supply of tokens is too 
small many agents will have no tokens and hence will be unable to pur- 
chase service; if the supply of tokens is too large, many agents will be at 
the threshold and hence will be unwilling to provide service; both of these 
degrade social welfare. Among all protocols with threshold K, there is a 
unique protocol, in which the supply of tokens is K/2, that would be best if 
agents always complied with the protocol - but this protocol need not be an 
equilibrium. Given the parameters of the population (benefit/cost ratio and 
discount factor) there is always at least one threshold - and at most two - 
for which this protocol is a robust equilibrium; and we provide estimates for 
the parameter ranges for which a particular protocol of this form is a robust 
equilibrium and of the efficiency of such robust equilibrium protocols. A 
consequence of these estimates is that, as the discount factor tends to 1 or 
the benefit /cost ratio tends to oo, the efficiency of such equilibrium proto- 
cols becomes arbitrarily close to first-best (trade always occurs). However, 
these protocols are not always the optimal equilibrium protocols: it may be 
more efficient for the designer to choose a higher threshold but to supply 
fewer tokens. In macro-economic language: K/2 need not be the optimal 
quantity of money. 

At the risk of redundancy, we would like to emphasize that the environ- 
ment we study and the questions we ask are different from those in '"stan- 
dard" monetary theory, and that the differences present new challenges, to 
which this paper responds. We believe this paper generates a number of 
insights; among these are: 

• the existence of equilibrium protocols is a delicate problem, but all 

- differing perhaps in the types of files they provide, their costs, the benefits they derive 
from various types of files, and even the frequency with which they request/provide service 

- the designer would recommend the same strategy to all agents of a particular type, and 
equilibrium would again be symmetric. 
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equilibrium protocols are quite simple; 

• the requirement that the system be practicable, and in particular that 
equilibrium protocols be robust to small perturbations in the popula- 
tion parameters, has significant impact on the nature of equilibrium; 

• the most efficient robust equilibrium protocols do not necessarily pro- 
vide half as many tokens as the selling threshold; 

• in order that efficiency close to 1 be achievable, it is necessary that 
either the benefit/cost ratio or the discount factor be high and that 
both the threshold and the supply of tokens be large; 

• efficiency can be attained in the limit when the benefit/cost ratio tends 
to oo or the discount factor tends to 1 

• explicit lower bounds on efficiency can be provided when the bene- 
fit/cost ratio is finite and the discount factor is not close to 1; 

• there is an effective procedure for constructing 'good' robust equilib- 
rium protocols. 

Following a further discussion of the literature below, the remainder of 
the paper is organized in the following way. Section 2 introduces the model, 
defines strategies, the steady state value function, best responses, equilib- 
rium and robust equilibrium. Section 3 describes the nature of equilibrium 
and robust equilibrium. Section 4 discusses existence of robust equilib- 
rium. Section 5 discusses efficiency of equilibrium protocols and shows that 
asymptotic efficiency can be obtained when agents become infinitely patient 
or the benefit/cost ratio increases without bound. However, the protocols 
we identify as asymptotically efficient are not necessarily the most efficient 
protocols (for fixed population parameters). Section 6 illustrates by a sim- 
ulation how big the efficiency loss from choosing the wrong protocol can be 
and proves that efficiency requires a large supply of tokens. Section 7 con- 
cludes and offers some directions for further research. Proofs are collected 
in the Appendix. 
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Other Literature 

We have discussed (some of) the literature on "money as a medium of 
exchange" . A somewhat different literature, on "money as memory" (see es- 
pecially Kocherlakota (1998) and Wallace (2010)) studies the extent to which 
the role of money can or cannot be served by record- keeping; i.e., when is 
money required? In some sense, the purpose of that literature is nearly the 
opposite of our purpose here. Record-keeping requires a great deal of cen- 
tralization; the use of tokens makes possible complete decentralization. An 
important issue for us is the welfare cost of this complete decentralization, 
which is discussed in Section 6. 

This work also connects to an Electrical Engineering and Computer 
Science literature that discusses token exchanges in online communities. 
Some of that literature assumes that agents are compliant, rather than self- 
interested, and does not treat incentives and equilibrium (Vishnumurthy, 
Chandrakumar & Sirer 2003), (Buttyan & Hubaux 2003); some of that lit- 
erature makes use of very different models than the one offered here (Tan 
& Jarvis 2006) and (Figueiredo, Shapiro & Towsley 2004); and some of the 
literature is not formal and rigorous, offering simulations rather than theo- 
rems (Pai & Mohr 2006). The papers closest to ours are probably Friedman, 
Halpern and Kash (2006, 2007), which treat somewhat different models. 
However, these papers seem puzzling in many dimensions and many of the 
proofs seem mysterious (at least to us). 

Another literature to which this work connects is the game-theoretic 
literature on anonymous interactions. In a context in which interactions 
were publicly observable, full cooperation (i.e., provision of service) could be 
achieved at equilibrium by the use of trigger strategies, which deny service in 
the future to any agent who refuses service in the present. As Kandori (1992) 
and Ellison (2000) have pointed out, in some contexts, cooperation can be 
supported even without public observability if agents deny service in the 
future to all agents whenever they have observed an agent who refuses service 
in the present; in this equilibrium any failure to provide service results in 
a contagion, producing wider and wider ripples of defection, until no agent 
provides service. However contagion is not likely to sustain cooperation in 
the systems of interest to us, because the population is so large (typically 
comprising tens of thousands or even hundreds of thousands of agents) that 
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an agent is unlikely, in a reasonable time frame, to meet any other agent 
whose network of past associations overlap with his. (When the population 
is literally a continuum, no agent ever meets any other agent whose network 
of past associations overlap with his.) 

A more relevant literature, of which Kandori (1992) is again the seminal 
work, uses reputation and social norms as devices as a means of incentiviz- 
ing cooperation. The work that is closest to ours is Zhang, Park & van der 
Schaar (2010), which asks which reputation-based systems can be supported 
in equilibrium and which of these achieve the greatest social efficiency Be- 
cause provision of service in their model depends on the reputations of both 
client and server, some central authority must keep track of and verify rep- 
utations; hence these systems are not distributed in the sense we use here. 
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2 Model 



The population consists of a continuum (mass 1) of infinitely lived agents. 
Each agent can provide a resource (e.g, a data file, audio file, video file, 
service) that is of benefit to others but is costly to produce (uploading a 
file uses bandwidth and time). The benefit of receiving this resource is b 
and the cost of producing it is c; we assume b > c > 00 Agents care about 
current and future benefits/costs and discount future benefits/costs at the 
constant rate /3 £ (0,1). Agents are risk neutral so seek to maximize the 
discounted present value of a stream of benefits and costs. 

Time is discrete. In each time period, a fraction p < 1/2 of the popula- 
tion is randomly chosen to be a client and matched with a randomly chosen 
server; the fraction 1 — 2p are unmatched^] (No agent is both a client and 
a server in the same period.) When a client and server are matched, the 
client chooses whether or not to request service, the server chooses whether 
or not to provide service (e.g., transfer the file) if requested. 

The parameters b, c, /3, p completely describe the environment. Because 
the units of benefit b and cost c are arbitrary (and tokens have no intrinsic 
value), only the benefit-cost ratio r = b/c is actually relevant. We consider 
variations in the benefit-cost ratio r and the discount factor f3, but view the 
matching rate p as immutable. 

2.1 Tokens and Strategies 

In a single interaction between a server and a client, the server has no 
incentive to provide services to the client. The mechanism we study for 
creating incentives to provide service involves the exchange of tokens. Tokens 
are indivisible, have no intrinsic value, cannot be counterfeited, and can 
be stored and transferred without loss. Each agent can hold an arbitrary 
non-negative finite number of tokens, but cannot hold a negative number 

6 If b < c there is no social value to providing service; if c < agents will always be 
willing to provide service. 

7 We assume that the matching procedure is such that the Law of Large Numbers holds 
exactly; Duffie & Sun (1997), Als-Ferrer (1999) and Podczek & Puzzello (forthcoming) 
construct such matching procedures. 
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of tokens and cannot borrow. We emphasize that our tokens are purely 
electronic objects and are transferred electronically. 

The designer creates incentives for the agents to provide or share re- 
sources by providing a supply of tokens and recommending strategies (be- 
havior) for agents when they are clients and servers. At the moment, we 
allow for strategies that depend on histories but we show that optimal strate- 
gies (best responses) depend only on current token holdings. 

An event describes the particulars of a match at a particular time: 
whether the agent was chosen to be a client or a server or neither, whether 
the agent was matched with someone who was willing to serve or to buy, 
whether the agent received a benefit and surrendered a token or provided 
service and acquired a token or neither, and the change in the token hold- 
ing. Write et for an event at time t. A history of length T specifies an initial 
token holding m and a finite sequence of events h = (m; €q, e±, e^-i)- Write 
Ht for the set of histories of length T, H = (J T Ht for the set of finite his- 
tories. An infinite history specifies an initial token holding m and an infinite 
sequence of events h = (m; eo, e±, . . .). We insist that finite/infinite histories 
be feasible in the sense that net token holdings are never negative (i.e., a 
request for service by an agent holding tokens will not be honored) . Given 
a finite or infinite history h, write d(h, t) for the change in token holding 
at time t and d + (h,t),d~ (h,t) for the positive and negative parts of d(h,t). 
Note that d(h,t) = +1 if the agent serves, d(h,t) = —1 if the agent buys, 
d(h, t) = otherwise. Note also that the token holding at the end of the 
finite history h is 

T-l 

N(h) = m+^2d(h,t) 
t=o 

A strategy is a pair (er, r) : H — > {0, 1}; r is the client strategy and a 
is the server strategy. Following the history h, r{h) = 1 means the client 
requests service and r(h) = means the client does not request service; 
cr(h) = 1 means the server provides service, o~(h) = means the server 
does not provide service. (Note that we require individual agents to follow 
pure strategies, but we will eventually allow for the possibility that different 
agents follow different pure strategies, so the population strategy might be 
mixed.) If service is requested and provided, a single token is transferred 
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from client to server, so the client's holding of tokens decreases by 1 and the 
server's holding of tokens increases by 1. Tacitly, we assume that a token is 
transferred if and only if service is provided; like the transfer of tokens itself, 
this can be accomplished electronically in a completely distributed way. 

2.2 Steady State Payoffs, Values and Optimal Strategies 

Because we consider a continuum population, assume that agents are matched 
randomly and can observe only their own histories, the relevant state of the 
system from the point of view of a single agent can be completely summa- 
rized by the fraction fi of agents who do not request service when they are 
clients and the fraction v of agents who do not provide service when they 
are servers. If the population is in a steady state then fi, v do not change 
over time. Given fi, v, a strategy (r, a) determines in the obvious way a 
probability distribution P(t, a\fi, v) over infinite histories H. We define the 
discounted expected utility to an agent whose initial token holding is m and 
who follows the strategy (r, a) to be 

oo 

Eu(m, t, a\n, v) = ^ P(t, a\fi, v){h) ^ (3 f [d + (h, t)b - d~(h, t)c] 
heH t=o 

(Here and below, when some of the variables (3, b, c, fi, v, r, a are clearly un- 
derstood we frequently omit all or some of them; this should not cause 
confusion.) 

Given fi, v, r, a and an initial token holding m we define the value to be 
V(m, fi, v, t, a) = sup Eu(m, r, cr\fi, u) 

(t,*) 

Discounting implies that the supremum - which is taken over all strategy 
profiles - exists and is at most 6/(1 — /3). 

Given fi, v the strategy (r, a) is optimal or a best response for an initial 
token holding of m if 

Eu{m, t, a\fi, v) > Eu(m, t , a'\fi, v) 

for all alternative strategies t' ,o~' . Because agents discount the future at the 
constant rate i3, the strategy (r, a) is optimal if and only it has the one- 
shot deviation property; that is, there does not exist a finite history h and 
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a profitable deviation (r' , <t') that differs from (r, cr) following the history h 
and nowhere else. A familiar and straightforward diagonalization argument 
establishes that optimal strategies exist and achieve the value; we record 
this fact below, omitting the proof. 

Proposition 1 For each fi, v and each initial token holding m there is an 
optimal strategy r, a and 

Eu(m, r, a\n, v) = V(m, fi, u, r, a) 
2.3 Optimal Strategies 

We want to characterize optimal strategies, but before we do, there is a 
degeneracy that must be addressed. If fi = 1 then no one ever requests 
service so the choice of whether to provide service is irrelevant; if v = 1 then 
no one ever provides service so the choice of whether to request service is 
irrelevant. In what follows, we sometimes ignore or avoid these degenerate 
cases, but this should not lead to any confusion. 

Fix /3, b, c, //, v\ let (r, a) be optimal for the initial token holding m. Note 
that the continuation of (r, a) must also be optimal following every history 
that begins with m. If h is such a history and the token holding at h is n 
then (t, a) induces a strategy (r h ,o- h ) from an initial token holding n that 
simply transposes what follows h back to time 0, and this strategy must be 
optimal for the initial token holding of n. Conversely, any strategy that is 
optimal for the initial token holding of n must also be optimal following h. 
It follows that optimal strategies (t, a) (whose existence is guaranteed by 
Proposition 1) depend only on the current token holding but are otherwise 
independent of history; we frequently say such strategies are Markov - but 
note that they are Markov in individual token holdings. Write v, f3) for 
the set of optimal strategies. 

Theorem 1 For all b, c, /3, fi, v with v < 1, every optimal strategy (t, cr) has 
the property that r(n) = 1 for every n > 1; i.e. "always request service when 
possible"^ 

8 Because a request for service will not be honored when an agent holds tokens, it is 
irrelevant whether r(0) = or r(0) = 1. 
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In view of Theorem [TJ we suppress client strategies r entirely, assuming 
that clients always request service whenever possible. We abuse notation 
and continue to write v, /3) for the set of optimal strategies. 

We now show that optimal (server) strategies also have a simple form. 
Say that the (server) strategy a is a threshold strategy (with threshold K) if 

a(n) = 1 if n < K 

a(n) = if n > K (1) 
We write o~k for the threshold strategy with threshold K and 

S = {a K : < K < oo} 
for the set of threshold strategies. 

Theorem 2 For each (j,,u,b,c, /3 with /i < 1 the set of optimal (server) 
strategies consists of either a single threshold strategy or two threshold strate- 
gies with adjacent thresholds. 

(The assumptions in Theorems [I] and [2] that v < 1 and /i < 1 avoid the 
degeneracies previously noted.) 

2.4 Protocols 

The designer chooses a per capita supply of tokens a G (0, oo) and recom- 
mends a strategy to each agent; we allow for the possibility that the designer 
recommends different strategies to different agents. Because self-interested 
agents will always play a best response, the designer will recommend only 
strategies in E; in view of anonymity, it does not matter which agents are 
recommended to play each strategy, but rather only the fraction of agents 
recommended to play each strategy. Hence we can identify a recommenda- 
tion with a mixed threshold strategy, which is a probability distribution on S; 
with the obvious abuse of notation, we view 7 as a function 7 : N+ —> [0, 1] 
such that 

l{K) > for each K > 

00 

= 1 

K=0 
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Write A(E) for the set of mixed threshold strategies. As usual, we identify 
the threshold strategy ok with the mixed strategy that puts mass 1 on ok- 
Assuming that the designer only recommends best responses (because other 
recommendations would not be followed), we interpret an element 7 G A(X) 
as a recommendation that the fraction j(K) play the threshold strategy ax- 

A protocol is a pair II = (a, 7) consisting of a per-capita supply of tokens 
a G (0, 00) and a mixed strategy recommendation 7 G A(£). 

2.5 Invariant Distributions 

If the designer chooses the protocol II = (a, 7) and agents follow the recom- 
mendation 7, we can easily describe the evolution of the token distribution 
(the distribution of token holdings). Note that the token distribution must 
satisfy the two feasibility conditions: 

00 

= 1 (2) 

fc=0 

00 

= a (3) 

k=0 

Write 

ii = n(o), v= ^2 

cr(fc)=0 

Evidently, \i is the fraction of agents who have no tokens, hence cannot pay 
for service, and v is the fraction of agents who do not serve (assuming they 
follow the protocol). 

To determine the token distribution next period, it is convenient to think 
backwards and ask how an agent could come to have k tokens in the next 
period. There are three possibilities; the agent could have 

• k — 1 tokens in the current period, be chosen as a server, meet a client 
who can pay for service, and provide service (hence acquire a token); 

• k + 1 tokens in the current period, be chosen as a client, meet a server 
who provides service, and buy service (hence expend a token); 
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• k tokens in the current period but and neither provide service nor buy 
service (hence neither acquire nor expend a token). 

Given a recommendation 7 it is convenient to define cr 7 : N+ — > [0, 1] by 

00 

°" 7 ( n ) = r y( K ) a K{ n ) 

K=0 

Note that cr 7 (ra) is the fraction of agents in the population who serve when 
they have n tokens, assuming that agents follow the recommendation 7 and 
that the Law of Large Numbers holds exactly in our continuum framework, 
so <t 7 is the population strategy. Keeping in mind that token holdings cannot 
be negative, it is easy to see that the token distribution next period will be 

V+ (k) = v (k-l)[p(l-p)a'(k-l)} 
+ v (k + l)[p(l-u)} 
+ 11 (k)[l-p{l-p)a<{k)-p(l-u)] (4) 

where we use the convention 77 (— 1) = 0. 

Given the protocol II = (0,7), the (feasible) token distribution 77 is 
invariant if 77+ = 77; that is, n is stationary when agents comply with the 
recommendation 7. Invariant distributions always exist and are unique. 

Theorem 3 For each protocol U = (a, 7) there is a unique invariant dis- 
tribution 7/ n , which is completely determined by the feasibility conditions |^|) 
and and the recursion relationship 

rF(k) = V u (k-l)[p(l-pW(k-l)\ 
+ r r n (k + l)[p(l-v)] 
+ rf n {k)[l-p{l- l JL)o-~<{k)-p{l-v)] (5) 

2.6 Definition of Equilibrium and Robust Equilibrium 

Assuming agents are rational and self-interested, they will comply with a 
given protocol if and only if compliance is individually optimal; that is, no 
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agent can benefit by deviating from the protocol. To formalize this, fix a 
protocol II = (a, 7), and let rf 1 be the unique invariant distribution. Write 

M n = ^(o) , - n = E v u (k) 

cr(fc)=0 

for the fraction of agents who have no tokens and the fraction of agents who 
do not serve (in the invariant distribution induced by n), respectively. We 
say II = (a, 7) is an equilibrium protocol if uk is an optimal strategy (given 
given /j, u ,r] U ) whenever j(K) > 0. That is, 7 puts positive weight only on 
threshold strategies that are optimal, given the invariant distribution that 
II itself induces. 

Using the one step deviation principle, we can provide a useful alterna- 
tive description of equilibrium in terms of the value function V. As noted 
before, because optimal strategies exist and are Markov, we may unambigu- 
ously write V k for the value following any history at which the agent has 
k tokens. (The value function depends on the population data p, v and on 
the environmental parameters b, c, /3; but there should be no confusion in 
suppressing those here.) 

Fix any Markov strategy a. In order for a to be optimal, it is necessary 
and sufficient that it achieves the value Ve following every token holding i. 
Expressed in terms of current token holdings and future values, and taking 
into account how behavior in a given period affects the token holding in 
the next period, this means that a is optimal if and only if it satisfies the 
following system of equations: 

V = pa(0)[(l-^)(-c + /3Vi) + ^/3Vb] 
+p[l - a(0)]pV o + (1 - 2p)m 



V k = P [{\-u){b + pV k -i) + vpV k ] 

+ pa(k)[(l - M )(-c + 0V k+1 ) + nPV k ] 
+p[l - a(k)]pV k + (1 - 2p)pV k 
for each k > (6) 

Applying this observation to the threshold strategy ok and carrying out the 
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requisite algebra, we conclude that gk is optimal if and only if 



-c + /3Vfe +1 > pV k if k<K 
-c + f3V k +i < PV k ifk>K 



(7) 
(8) 



(If it seems strange that a, 7 do not appear in these inequalities, remem- 
ber that the value depends on the invariant distribution rj n , which in turn 
depends on a and on 7.) 

Given a benefit /cost ratio r > 1 and a discount factor f3 < 1, write 
EQ(r, (3) for the set of protocols II that constitute an equilibrium when 
the benefit /cost ratio is r and the discount factor is (3. Conversely, given 
a protocol II write E(U) for the set {(r, (3)} of pairs of benefit/cost ratios 
r and discount factors (3 such that II is an equilibrium protocol when the 
benefit /cost ratio is r and discount factor is (3. Note that EQ,E are corre- 
spondences (which might have empty values) and are inverse to each other. 

Given r, f3 we say that II is a robust equilibrium if (r, (3) belongs to the 
interior of E(T1); i.e., there is some e > such that II G EQ(r' , f3') whenever 
\r'— r\ < e and \/3'— (3\ < e. Write EQR(r, (3) for the set of robust equilibrium 
protocols for the benefit/cost ratio r and discount factor (3 and ERiJl) 
for the set {(r, (3)} of pairs of benefit /cost ratios for which II is a robust 
equilibrium. Note that EQR, ER are correspondences (which might have 
empty values) and are inverse to each other. 
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3 Equilibrium and Robust Equilibrium 



We first describe the nature of equilibrium and robust equilibrium and then 
use that description to show that robust equilibria exist. The crucial fact 
about equilibrium is that the strategy part of an equilibrium protocol can 
involve mixing over at most two (thresholds and that these thresholds must 
be adjacent; the crucial fact about robust equilibrium is that the strategy 
cannot involve strict mixing at all but must rather be a pure strategy. 

Theorem 4 For each benefit/ cost ratio r > 1 and discount factor f3 < 1 the 
set EQ(r, (3) is either empty or consists of protocols that involve only (possi- 
bly degenerate) mixtures of two threshold strategies with adjacent thresholds. 

Theorem 5 If 11= (a, a) is a robust equilibrium then a is a pure threshold 
strategy. 

The existence of equilibrium or robust equilibrium does not seem at all 
obvious (and our proof is not simple). For both intuition and technical 
convenience, it is convenient to work "backwards": rather than beginning 
with population parameters r, f3 and looking for protocols II that constitute 
an equilibrium for those parameters, we begin with a protocol II and look 
for population parameters r, (3 for which II constitutes an equilibrium. That 
is, we do not study the correspondences EQ(r, (3) and EQR(r, (3) directly, 
but rather the inverse correspondences E{H) and ER{H). (This is easier for 
several reasons, one of which is that the latter correspondences are always 
non-empty.) 

To give an intuitive understanding of the difficulty and how we overcome 
it, fix a protocol II = (a, a) and let ?? n be the invariant distribution. Because 
we will eventually want to find a robust equilibrium, we assume a is a 
threshold strategy: a = ok- To look for population parameters r, (3 for 
which II is an equilibrium, let us fix r and let (3 vary. (Alternatively, we 
could fix f3 and let r vary, or vary both {3, r simultaneously, but the intuition 
is most easily conveyed by fixing r and letting f3 vary.) As we have already 
noted, the invariant distribution rj u , and hence /u n , v n , depends only on II. 
If (3 is close to 0, an agent will have little incentive to acquire tokens, but 
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the incentive to acquire tokens increases as ft — > 1. It can be shown that 
there is a smallest discount factor /3l(I1) with the property that an agent 
whose discount factor is at least will be willing to continue providing 

service until he has acquired K tokens. This is not enough, because gk will 
only be incentive compatible if the agent is also willing to stop providing 
service after he has acquired K tokens. However, it can also be shown that 
there is a largest discount factor /3#(n) for which the agent is willing to 
stop providing service after he has acquired K tokens, and that < 
(Recall that r, II are fixed.) For every discount factor f3 in the 
closed interval [/?l(II), /3#(n)], the protocol II is an equilibrium when the 
population parameters are r, (3; that is, (r, /3) £ E(U). From this it can 
be shown that for every discount factor /3 in the interval (/J/, (II), /3#(n)), 
the protocol II is a robust equilibrium when the population parameters are 
r, /3; that is, (r, /3) £ ER(U). Similarly, we can hold /3 fixed and let r 
vary from 1 to oo, construct the corresponding intervals [rj,(II), r#(II)] with 
rx(II) < rff(n) and then show that for every benefit/cost ratio r in the 
open interval (r 1,(11), r#(n)) the protocol II is a robust equilibrium when 
the population parameters are r, /3; that is, (r, (3) £ ER(TT). This is the 
content of Theorem |6] below. 

Applying this procedure for every protocol yields a family {ER(T1)} of 
non-empty open sets of parameters r,f3 for which robust equilibria exist. 
However our work is not done; we would also like to know that {ER(U)} 
covers a big enough set of population parameters. For instance, we want to 
know that for each r > 1 there is a j3* < 1 such that {ER(U)} covers the 
set n(r, 0) : f3 > f3*}; this would mean that for each r > 1 and (3 > f3* there 
is a protocol n that constitutes a robust equilibrium for these population 
parameters. Proving that this is so is not at all easy, and requires deriving 
some special properties of protocols of the form Tlx = (K/2, ctk)', this is the 
content of Theorem [7| below. 

It is natural to ask why our proof seems (and is) so much more compli- 
cated than existence proofs in the literature, such as in Berentsen (2000). 
The answer is that the literature assumes that there is an exogenous upper 
limit K* on the number of tokens any agent can hold. As discussed above, 
this assumption makes it relatively easy to show that equilibrium exists: 
Fix the benefit/cost ratio fix r > 1 and an arbitrary a > and consider 
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the protocol (a, ax*)- As above, an agent whose discount factor ft is at 
least /3l(&,o~k*) will provide service until he has acquired K* tokens, and 
will stop providing service after he has acquired K* tokens because, by as- 
sumption, he cannot hold more than K* tokens. Hence (r, f3) 6 E(a,crK*) 
for every j3 > /3l(iI) and (r, (3) £ ER(a, ax*) for every (3 > (3i,(a, ok*)- 
As we have noted in the Introduction, assuming an exogenous upper bound 
on token holdings does not seem realistic in the environments we consider. 
However, even if a large exogenous upper bound K* were imposed, our ap- 
proach would yield much more, because we can construct robust equilibria 
for discount factors much smaller than /3i(a, o~k*)- 

Theorem 6 Fix a protocol H = (a, o~k)- 

(i) For each benefit/cost ratio r > 1, the set {(3 : n G EQ(r,{3)} is a non- 
degenerate closed interval [/?i(n), /3fl-(n)] whose endpoints are contin- 
uous functions ofr. 

(ii) For each discount factor (3 < 1, the set {r : II £ EQ(r,f3)} is a non- 
degenerate closed interval [77 (II), r#(n)] whose endpoints are continu- 
ous functions of (3. 

These results are illustrated for a = 1/4 in in Figures [T] and [2| (Figure [T] 
may give the impression that the intervals for successive values of K do not 
overlap, but as Figure [2] illustrates, they actually do overlap; the overlap is 
masked by the granularity of the Figure. However, as we have already said, 
we do not assert that overlapping of intervals for successive values of K is a 
general property.) 

For the special protocols Uk = (K/2,ax), in which the supply of to- 
kens is exactly half the selling threshold, we can say a great deal more. In 
particular, the intervals corresponding to successive values of the threshold 
overlap but are not nested. As suggested above, this is exactly what we 
need to guarantee that equilibria exist (for sufficiently large values of the 
parameters r, f3). 
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Theorem 7 

(i) For each fixed benefit-cost ratio r > 1, successive f3-intervals overlap 
but are not nested: 

p L (n K -i) < Pl(Pk) < M n *-i) < PhQLk) 

for every threshold K . Moreover 

lim h(U K ) = 1 

K— >oo 

In particular, there is some /3* < 1 such that EQR{r, (3) ^ for all 
P > P*. 

(ii) For each fixed discount factor (3 < 1, successive r-intervals overlap but 
are not nested: 

r L (n x _i) < r L (U K ) < r H (PK-l) < th^-k) 

for every threshold K . Moreover 

lim tlQ^k) = oo 

K— >oo 

In particular, there is some r* > 1 such that EQR(r, p) ^ for all 

It follows from Theorem [7] that, as K — > oo, the left-hand end-points 
Pl(J1k) — > 1) so a fortiori the lengths of /3-intervals shrink to 0. It is 
natural to guess that the lengths of these intervals shrink monotonically to 
0, and simulations suggest that this guess is correct, but we have neither 
a proof nor a good intuition that this is actually true. We also guess that 
the lengths of r-intervals shrink monotonically, but again we have neither a 
proof nor a good intuition that this is actually true. 
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Figure 1: Pure and Mixed Equilibrium: a = 1/4 

(blue (thick) - pure equlibrium; red (thin) - mixed equilibrium) 
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Figure 2: Threshold Equilibrium: a = 1/4 
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4 Efficiency 



If agents were compliant (rather than self-interested), the designer could 
simply instruct them to provide service at every meeting and they would 
comply, so the per capita social gain in each period would be p(b — c). 
If agents follow the protocol II = (a, o~k) then service will be provided 
only in those meetings where the client can buy service and the server is 
willing to provide service, so the per capita social gain in each period will 
be p(b — c)(l — /u n )(l — z/ n ). Hence we define the efficiency of the protocol 
II to be 

Eff(n) = (l-^ n )(l- I , n ) 

In general it seems hard to determine the efficiency of a given protocol or 
to compare the efficiency of different protocols. However, we can provide 
efficiency bounds for protocols that utilize a given threshold strategy ok 
and compute the precise efficiency of the protocols n^j^] 

Theorem 8 For each a 6 (0,oo), each threshold K and all values of the 
population parameters we have: 

(i) ES(a,a K )<l-^ 

(ii) m(a,a K ) < Eff(n x ) 

(Hi) m(u K ) = (1 - jfa) 

Two implications of Theorem [8] are immediate. The first is that, in order 
that a (threshold) protocol achieve efficiency near 1 it is necessary that it 
provide a large number of tokens and also that it prescribe a high selling 
threshold. Put differently: to yield full efficiency in the limit it is not enough 
to increase the number of tokens without bound or to increase the threshold 
without bound - both must be increased without bound. The second is that 
the protocols ILk that provide K/2 tokens per capita are the most efficient 
protocols that utilize a given threshold strategy o~k- 

We caution the reader, however, that the protocols need not be equi- 
librium protocols, and it is (robust) equilibrium protocols that we seek. 

9 Berentsen (2000) derives similar results in a different model, with Poisson arrival rates. 
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However, it follows immediately from Theorem [7] that whenever agents are 
sufficiently patient or the benefit-cost ratio is sufficiently large (or both), 
then some protocol Hk is an equilibrium for large K, and hence that nearly 
efficient equilibrium protocols always exist. 

Theorem 9 

(i) for each fixed discount factor ft < 1 

liminf sup{Eff(n^) : U K G EQR(ft,r)} = 1 

r— >oo 

(ii) for each fixed benefit-cost ratio r > 1 

liminf sup{Eff(n^) : U K G EQR(ft,r)} = 1 

In words: as agents become arbitrarily patient or the benefit/cost ratio 
becomes arbitrarily large, it is possible to choose robust equilibrium protocols 
that achieve efficiency arbitrarily close to first best. However, as the sim- 
ulation illustrated in Figure [3] demonstrates, the protocols Hk need not be 
the most efficient equilibrium protocols; in this sense, K/2 is not the optimal 
quantity of money 10 

Some intuition might be useful. Consider the protocols Tlx and the 
corresponding invariant distributions. As K increases, the fraction of agents 
who cannot purchase service and the fraction of agents who will do not 
provide both decrease - so efficiency increases. However, if r, ft are fixed and 
K increases then the protocols Hk will eventually cease to be equilibrium 
protocols so equilibrium efficiency is bounded. On the other hand, if we fix 
r and let ft — > 1 or fix ft and let r — > oo then the thresholds K for which the 
protocols 11^ are equilibrium protocols blow up, and hence efficiency tends 
to 1. Put differently: high discount factors or high benefit/cost ratios make 
the use of high thresholds consistent with equilibrium. 

Theorem [9] provides asymptotic efficiency results; the following result 
presents an explicit lower bound (in terms of the population parameters 
r, ft) for the efficiency obtainable by a robust equilibrium protocol. 

10 Contrast Berentsen (2000), for instance, in which K/2 is the optimal quantity of 
money - but only under the assumption that K is an exogenous bound on token holdings. 
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Theorem 10 Given the benefit/cost ratio r > 1 and the discount factor 
P < 1, define 



ii 



K L = max i log P p ( — — ] — J . ( ) 

= log P/» 

x-P+pP \ 2r 



Then: 



(i) all the thresholds K for which II# is a robust equilibrium protocol lie 
in the interval [K L ,K H ]; 



(ii) the efficiency of the optimal robust equilibrium protocol is at least 

\ L KL+l ) - \KL+l ) ■ 

Theorem [To] yields a lower bound on efficiency because the optimal robust 
equilibrium protocol is at least as efficient as any protocol Hk that is a 
robust equilibrium, but does not yield an upper bound on efficiency because 
the optimal robust equilibrium protocol might be more efficient than any 
protocol ILfc that is a robust equilibrium. 



Theorem 10 also yields the designer an effective procedure for finding a 
robust equilibrium whose efficiency is good, if not optimal, since all that is 
necessary is to check protocols Hjc with thresholds K in the (finite) interval 
[K ,K H ]. Moreover, it is not necessary to conduct an exhaustive search. 
Rather the designer can begin by checking the protocol Hk, where K is the 
midpoint of the interval [K L ,K H \. If M aK {K - 1) > c//3 and M aK (K) < 
c/{3, then Uk is an equilibrium protocol and the search can stop. If M aK (K— 
1) < c/P, then for all K' > K, M OK ,{K' - 1) < c/0 (because P L {Ii K ,) > 
(3 l (T1k))- Therefore threshold protocols for which K' > K cannot be an 
equilibrium and the designer can restrict search to the left half interval 
[K L ,K]. If M aK {K) > c/P, then for all K' < K, M ax ,{K') > c/p (because 
P h (Hk') > P H (£Lk))- Therefore threshold protocols for which K' < K 
cannot be an equilibrium and the designer can restrict search to the right 
half interval [K, K H ] . Continuing to bisect in this way, the designer can find 
an equilibrium threshold protocol in at most log 2 (^^ — K L ) iterations. 



'Note that both the basis of the logarithms and the arguments are less than 1. 
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Figure 3: Optimal Equilibrium Protocols 

(red (thick) - IIr:; blue (thin) - other protocols) 
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Figure 4: Inefficient Equilibrium Protocols 

(red (thick) - II3; blue (thin) - optimal equilibrium protocols) 
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5 Choosing the Right Protocol 



The reader may wonder why we have put so much emphasis on choosing the 
right protocol. As Figure [3] already shows, the reason is simple: choosing 
the wrong protocol can result in an enormous efficiency loss. Figure |4j 
which compares efficiency of the most efficient protocol with efficiency of a 
protocol for which the strategic threshold is constrained to be K = 3, makes 
this point in an even starker way: as the reader will see, except for a small 
range of discount factors, the efficiency loss is enormous. 
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6 Conclusion 



In this paper, we have analyzed in some detail a simple, practicable and 
distributed method to incentivize trade in on-line environments through the 
use of (electronic) tokens. We have shown that when agents are patient, 
the method we offer can achieve outcomes that are nearly efficient, provided 
the right protocol (supply of tokens and recommended threshold) is chosen, 
but that equilibrium and efficiency are both sensitive to the precise choice 
of protocol. Surprisingly, the "optimal" supply of tokens need not be half 
the recommended threshold; this conclusion, and others, and much of the 
difficulty of our arguments are a consequence of our allowing agents to ac- 
cumulate as many tokens as they wish, rather than imposing an exogenous 
bound on token holdings (which is common in the literature). 

Our analysis is silent about convergence to the steady state. In partic- 
ular, we do not know whether the recommended strategies would lead to 
convergence to the invariant distribution for all initial token distributions 
or for some particular token distributions. Berentsen (2000) proves conver- 
gence under some conditions, but in a continuous time model in which token 
holdings are subject to an exogenous bound. We have already noted that 
the latter is a strong (and, in our view, unrealistic) asumption, but another 
point is worth making. By definition, the recommended strategy is a best 
reply when the system is in the steady state, but the recommended strategy 
need not be a best reply - and very likely is not a best reply - when the 
system is not in the steady state - so why should agents follow it? 

We have assumed that tokens are indivisible and must be exchanged 
for service one-for-one. Indivisibility seems an unavoidable consequence of 
the environment in which we are interested, but even if we were to allow 
for exchanges of fractional tokens, an equilibrium in which fractional tokens 
are exchanged cannot be robust, for the same reason that a mixed strategy 
equilibrium cannot be robust: any small perturbation of the population 
parameters will destroy the equilibrium. The effect of divisible money might 
be achieved by allowing for random offers and acceptances, as in Berentsen, 
Camera & Waller (2007), but again, equilibrium with random offers and 
acceptances could not be robust. 

An alternative approach would be to allow agents to offer or demand any 
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integral number of tokens, depending on their own holdings 12 However, if 
agents can demand multiple tokens then it would seem that no matter an 
agent's current holding, he would always be willing to serve if the price 
of service was sufficiently high. Conversely, it would seem that no matter 
the price of service, an agent would be willing to pay if her token holding 
were sufficiently high. Hence, an equilibrium protocol in which agents could 
demand multiple tokens would seem to induce a stationary distribution with 
unbounded support and require offers and acceptances to depend on current 
holdings. It is not clear to us that such an equilibrium protocol could exist; 
certainly it would not seem to be practicable]^] 

We have considered the simplest setting, in which agents are identical, 
all files are equally valuable, and no errors occur. In a more realistic setting, 
we would need to take account of heterogeneous agents and files and allow 
for the possibilities of errors (in transmission of files or exchange of tokens 
or both). We have followed here the well-known adage "one has to start 
somewhere" - but we are keenly aware that there is much more work to be 
done. 



1 Much of the literature assumes that this is not possible, assuming instead that agents 
who "go looking" for trade can only carry a single token, no matter their current stock of 
tokens. 

13 Invariant distributions with unbounded support are impossible if the number of agents 
is finite, no matter how large; in this case it would also seem hard to see how a model with 
a continuum of agents would be a good proxy for a world with a finite, but very large, 
number of agents. 
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Appendix: Proofs 



Proof of Theorem 1 We first estimate V(n + 1) — V{n) (for n > 0) 
which is the loss from having one less token. To this end fix an optimal 
Markov strategy (t, a). We define a history-dependent strategy (r',a') and 
estimate the expected utility to an agent who begins with n tokens and fol- 
lows (r', a'); this is a lower bound on V(n). The strategy (r', a') is most 
easily described in the following way: Begin by following the behavior pre- 
scribed by the strategy (a, r) but for an agent who holds one more token 
than is actually held; i.e., (r',a')(h) = (r,a)(N(h) + 1). If it never hap- 
pens that the agent holds tokens, requests service, and is matched with 
an agent who is willing to provide service, then continue in this way for- 
ever. If it does happen that the agent holds tokens, requests service, and 
is matched with an agent who is willing to provide service, then service is 
not provided in that period (because the agent cannot pay) and after that 
period (t',0 7 ) = (r, a). In other words, the agent behaves "as if" he held 
one more token than actually held until the first time such behavior results 
in requesting service, being offered service, and being unable to pay for ser- 
vice; after that point, revert to (r, a). The point to keep in mind is that if a 
moment of deviation occurs then an agent with one more token would hold 
exactly 1 token, would request and receive service, and in the next period 
would have tokens - so that reverting to (r, a) is possible. Beginning with 
n tokens and following the strategy (r', a') yields the same string of payoffs 
as beginning with n + 1 tokens and following the strategy (r, a) except in 
the single period in which deviation occurs; in that period the expected loss 
of utility is at most bp. Hence the expected utility from beginning with n 
tokens and following the strategy (r', a') yields utility at least V(n + 1) — bp. 
Hence V(n + 1) — V{n) <bp<b<b/(3. However, this is the incentive com- 
patibility condition that guarantees that an agent strictly prefers to request 
service when holding n + 1 tokens, so the proof is complete. ■ 

At this point it is convenient to collect some notation and isolate two 
technical results. Fix p, b, c, p, v and consider a Markov strategy a. For each 
k, let V a (k, (5) be the value of following a when the initial token holding is k 
and the discount factor is (3. As with the optimal value function V defined 
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in the text, the value function V a can be defined by a recursive system of 
equations: 

V a (0,(3) = pCT (0)[(l-/i)(-c + /3y CT (l,/3)) 

+p[l - (7(0)]/3^(0, ^) + (1 - 2p)/3^(0,/3) 



F CT (M) = p[(l-v)(b + (3V a (k-l,l3) + i;(3V a (k,/3)} 

+pa(k)[(l - n)(-c + pV a (k + 1, /3)) + pf3V a (k, /3)] 
+p[l - (7(fc)]/3F CT (fc,/3) + (1 - 2p)pV a {k,P) 
for fc > (9) 

From the value function, we define the marginal utilities 

M a (k,p) = V a {k + l,p)-V a (k,P) (10) 

If (3 is fixed/understood, we simplify notation by writing V a {k) = V a (k,f3) 
and M a {k) = M a {k,p). 

It is also convenient to introduce some auxiliary parameters: 

& = -(i-v) P p 

</> c = 1 - /3 + ((1 - u) + (1 - n)) P p 

4>r = -(l-fl)pP 



We note the signs of these parameters and various combinations: 

<Pl < , <t> c > , (f> r < 

4>i + 4>c + 4>r > o , 4>i + <t> c > o , </v + 4> c > o 



(11) 



(12) 



Using these auxiliary parameters and the recursion relations for V(a 
and performing some simple algebraic manipulations yields a useful matrix 
representation involving marginals that we will use frequently: 



V ••• 

4>l <f>r. 



M CT (0) 
M CT (1) 

M C (K X - 1) 



KxK 



(1 - v)pb 




(1 - p)pc 



(13) 
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In short form, write this matrix representation as 



$M = u (14) 

Lemma 1 Fix p, b, c, jx, v and (3. Let a be a Markov strategy with the prop- 
erty that 

f 1 if < k < K x 
a ^ ' ~ y if K 1 <k<K 2 

Then: 

(i) ifO<k< K 2 then M a {k) > 

(ii) in the range < k < K\, M a is either increasing, decreasing or de- 
creasing then increasing 

(Hi) ifM a {K x -l) > c/P thenM a (0) > M a (l) > ...M a {K\ — 2) > M a (K\ — 
1) > c/(3 

Proof We first consider the token holding levels < k < K\ . We make use 



of the matrix representation (13). 

To prove (i), we first show that M a (k) > for < k < K\. If K x < 3, 
this follows by simply solving the matrix representation, so we henceforward 
assume K\ > 3. If there exists a token holding level k* with < k* < K\ 
such that M a {k*) < then one of the following must hold: either (a) there 
two consecutive such token holding levels, or (b) the marginal payoffs of the 
neighboring token holding levels are both positive. We consider these cases 
separately. 

(a) In this case, there exists k* such that M a {k*),M a {k* + 1) are both 
non-positive. Of these marginals, one is at least as big; say M a (k*) > 
M a (k* + 1). From the identities above we see that 

+ = Wn + Wr + i) 



(& + <k)M g (fc* + l) 



< M a (k* + 1) 
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Proceeding inductively, it follows that > M a (k*) > M a (k* + 1)... > 
M a (K 1 -l). Moreover, 

<f> e M a (Ki-l) = (1- n)pc-<f> l M a (K 1 - 2) 
> -<f>iM{K-l) 



This requires (j) c < —<j>\ which contradicts the sign relations (12). The 
argument when M a (k* + 1) > M a (k*) is similar and is left for the reader. 

(b) In this case, there exists k* such that M a (k* — 1) > 0, M a (k*) < 0, 
M a (k* + 1) > 0. This entails 

4>iM a (k* - 1) + <f> c M a (k*) + (f> r M a (k* + 1) < (15) 



which again contradicts the sign relations (12). 

From the above we conclude M a (k) > for < k < K\. To see that 
M CT (0) > note that 

-<f> r M a (l) = <A c M CT (0) - (1 - v)pb < -<t> r M a (0) (16) 

Therefore, M CT (1) < M CT (0), so M CT (0) > 0, as desired. 

Finally, to see that M a (k) > for K\ < k < K2, apply the recursion 
equations ^ to obtain 

<PiM a (k-l) + (0 c + r )M a (k) = O (17) 



We know that M a (Ki— 1) > so the sign relations ( 12 ) imply that M a {K\) > 
as well. Now it follows inductively that M a (k) > for K\ < k < Ki- This 
completes the proof of (i) . 

To prove (ii) it is enough to show that M a has no local maximum for 
< k < K\. If M had a local maximum k* in this range we would have 
M a (k*) > M a (k* - 1) and M a (k*) > M a (k* + 1). However, algebraic 
manipulation yields the inequalities 

-cPlM a (k* - 1) - 4> r M a (k* + 1) 



M a (k*) 



< y ' Yr MJk*) 

< M a (k*) 
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which is a contradiction. This establishes (ii) 



To prove (iii), first manipulate the matrix identity (13) to obtain: 
(1 - v)pPM (J {K 1 - 2) 

= (1 - + ((1 - v) + (1 - n)) P P)M a {K - 1) - (1 - p,)pc (18) 
> (1 - p + (1 - u) P p)M a (K 1 - 1) > (1 - v)pPM a {K 1 ~ 1) 

In view of (ii), the marginal payoffs are decreasing, so this establishes (iii). ■ 



Lemma 2 Fix p, b, c and a threshold protocol H = (a, ax) with correspond- 
ing p n ,v n . The marginal utility M aK (k,P) is strictly increasing in the dis- 
count factor (3, i.e., ifO < Pi < fa < 1; then, 

M aK (k, Px) < M aK (k, p 2 ) for all k (19) 

Proof To economize slightly on notation we write a = o~k- We present the 
proof in three steps. 

In Step 1, we prove that if there exist < K\ < K<i < K — 1 such that 
VTc G [K\, K2], M a (k, Pi) > M a (k,P2), then at least one of the following is 
true, M a {Ki-l,Pi) > M a (Ki-l,p 2 ) or M ff (K 2 + l,pi) > M a (K 2 + 1, p 2 ). 

In Step 2, we prove that if there exists a k* G [0, K — 1] such that 
M a (k*,Pi) >M a {k*,P2), then for all k G [0,K - 1], M a {k,Pi) >M C7 (k,p 2 ). 
Step 2 uses the result of Step 1. 

In Step 3, we disprove the possibility that k G [0, K — 1], M a (k,Pi) > 
M a (k,p 2 ). 

Step 2 and Step 3 together show a contradiction and therefore, k G 
[0,K-1], M a (k,Pi) <M a (k,p 2 ). 

Step 1 We assert that if there are indices < Ki < K 2 < K — 1 such 
that M a (k,Pi) > M a (k,p 2 ) for all K x < k < K 2 then at least one of the 
following must hold: 

(A) M a (Kt - l,Px) > M a (K\ - l,p 2 ) 

(B) or M a (K 2 + l,pi) > M a (K 2 + l,/3 2 ). 
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To see this, note that simple manipulations of the matrix representation 
PJ) yield 



• if K 2 = Ki then 



(l-^M ff (^-l,^) + 



(l-n)pM a (K 2 + l,P) 

(1/0 - 1 + ((1 - v) + (1 - p))p)M tT (K 1 ,0) 



• if K 2 > Ki then 



{l-v)pM <J {K l -l,p) + 



(1- p) P M (T (K 2 + l,P) 
(l/j3-l + (l-p)p)M a (K 1 ,0) 

- l)[M a {K x + 1,0) + ... + M a (K 2 - 1, 0)] 
+(1/0 -l + {l-v) p)M a {K 2 ,0) 



Since 0\ < 2 and we have assumed M a (k,0i) > M a (k,0 2 ) for < K\ < 
K 2 < K — 1, in each of the cases above the right-hand side is larger when 
= (3i than when (5 = f3 2 . Because the terms in the left-hand sides are 
positive, it follows that at least one of (A), (B) must hold, as asserted. 

Step 2 We assert first that if there is a F,0 < k* < K\ such that 
M a (k*,f3\) > M a (k*, (3 2 ), then at least one of the following must hold: 

(C) there exists some K 3 , < K 3 < K x , such that M a (k,Pi) > M a (k,fi 2 ) 
for all k, < k < K 3 

(C) there exists some K 4 , < K A < K x , such that M a (k,Pt) > M a (k,f3 2 ) 
for all k, K 4 <k< K_l 

To see this, note first that if A;* = satisfies the hypothesis, then (C) holds 
with K 3 = and that if k* = K — 1 satisfies the hypothesis, then (D) holds 
with K± = K — 1. Hence it suffices to consider a k* , < k* < K — 1, that 
satisfies the hypothesis. We now make use of Step 1. Set K\ = K 2 = k* . 
Applying Step 1 once increases the token holding interval where M a (k, fii) > 
M<j(k, (3 2 ) by 1. Let K\ and K 2 be the new end points of the interval and 
apply Step 1 again. Continuing in this way we come eventually to a point 
where either K\ = or K 2 = K — 1. If K\ = 0, set K 3 = K 2 and note that 
(C) holds. If K 2 = K - 1, set K 4 = K - 1 and note that (D) holds 
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We now show that either (C) or (D) leads to the desired conclusion. 
Consider (C) first. Using the matrix representation (13) we obtain 



{l-v)pPM a {K x + l,P) + (l-u)pb 

= [l-(l-(l-p)p)P]M a (0,P) 

+ (l-P)[M (T (l,P) + ... + M ( K 1 -l,P)} 
+ [\-{\-{\-u)p)P]M a {K 1 ,P) 

The right-hand side is bigger when j3 = Pi than when P = p 2 . Therefore 
M^Kx + lifc) > M a (K! + l,l32). By induction, M {k,pi) > M a (k,p 2 ) for 
all k, < fc < K - 1. 



Now consider (D). Using the matrix representation (13) we obtain 

(1- p)pf3M a (K 2 -l,P) + (l-p)pc 

= [l-{l-(l- v ) p )j3]M a {K-l,P) 

+ (1 - (3)[M a (K -2,(3) + ... + M a (K 2 + 1,0 
+ [l-U-U-^H^M^,/?) 

The right-hand side is bigger when (3 = f3\ than when (3 = (3 2 . Therefore 
M a (K 2 - 1, /3i) > M CT (iiC 2 - 1, h). By induction, M a (k, A) > M a (k, f3 2 ) for 
all fc, < k < K - 1. 

Taking (C) and (D) together completes Step 2. 



Step 3 Using the matrix representation (13) we obtain 

[1-(1- (l-n)p)p]M ff (p,P) 
+(1 - p)[M a (l,P) + ... + M a {K x - 1,P)] 
+[l-(l-(l-v)p)P]M a {K-l,p) 
= (1 — v)pb + (1 — p)pc 

In view of Step 2, the left-hand side is bigger when j3 = f3\ than when j3 = f3 2 . 
However, the right-hand side is independent of /3, so this is a contradiction. 
We conclude that M a (k, Pi) < M a (k, p 2 ) for every k, Q < k < K - I. ■ 



Proof of Theorem 2 Fix p. The Markov strategy a is optimal if and only 
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if it satisfies the Bellman optimality conditions: 



(3(V a {k + l)-V a (k))>c,ifa{k) = l (20) 
f3(V a (k + l)-V a (k)) <c,ifa(k) = (21) 

If a is not a threshold strategy, there must exist integers K\ < K 2 such that 

a(k) = 1, < k < Ki 

a(k) = 0, K l <k<K 2 (22) 
a(k) = 1, k = K 2 

We will show that the Bellman optimality conditions are violated at K 2 
and K 2 — 1. To this end, let K3 be the smallest integer greater than K 2 
for which a(K 3 ) = 0. (Such an integer exists because it cannot be optimal 
to serve when the token holding is sufficiently high.) Thus cr(k) = 1, for 
K 2 <k<K 3 and M a (K 3 - 1) > c/fi. Following a, 

M a (K 3 - 2) = [(1 - fi)pc - 4> c M a {K z - 1)]/^ > M a (K 3 - 1) > c//3 (23) 

An inductive argument shows that M a (K 2 ) > M a (K 2 + l) > c/(3. According 
to the recursion equations ^ we have 

M a (K 2 - 1) = (<p c M a (K 2 ) + 4> r M a [K 2 + l))/(-0j) > c//3 

which is a contradiction. We conclude that a non-threshold strategy cannot 
be optimal; equivalently, only threshold strategies can be optimal strategies. 

It remains to show that the only possible optimal threshold strategies 
have adjacent thresholds. Consider first two threshold strategies with con- 
secutive thresholds K and K + 1. We assert that 

M aK (K) < c/P & M aK+1 (K) < c/P (24) 

We prove direction "=>-" ; the "<£=" direction is similar and left to the reader. 
Suppose instead that M aK+1 (K) > c/(3. It follows that —(p r M (TK+1 (K) > 
(1 — [i)pc. If we delete the last line in the matrix equation ( 13 ) for <jk+i and 
move M UK+1 (K) to the right-hand side, we get another matrix equation 

<£>KxKM aK+1 = u 
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where u = ((l-v)pb, 0, 0, -(j) r M aK+1 (K)) T . For the threshold K, $kxkM 
u. Therefore, 

$Kxk(M* k +i ~ M *k) = u - u (25) 

Lemma 1 guarantees that u— u > 0, so M ffJf+1 > M ffJf . That is, M aK+1 (k) > 
M UK {k) for < k < K-l. Because M UK+1 (K) > c/P > M aK {K), it follows 
that M aK+1 (k) > M aK (k) for < k < K. According to the matrix equation, 
the following identity holds for both a = ok and a = (Jk+V 

(1 - v)pb + (1 - n)pc 
=(l-f3+(l-p)pf3)M a (0) 

( 26 ) 

+(1 - p) J2 M a (k) + (1 - p + (1 - v)pf3)M a (K) 
k=i 

This is a contradiction so we have established the direction =^>, as desired. 
It follows directly from the matrix identity that 

M aK (K) = c/p M aK+1 (K) = c/P 

Hence 

M aK (K) > c/P & M aK+1 (K) > c/p (27) 

We now assert that if K > K then 

M aK (K) < c/P M ak (K - 1) < c/P (28) 
We have already shown that this is true when K = K + 1; i.e. M aK+1 (K) < 



c/p. Consider K = K + 2. Of M aK+2 (K + 1) > c/P, then (27) implies 
that M aic+1 (K + 1) > c//3. Therefore, M aK+1 (K + 1) > M aK+1 (K). This 
is a contradiction to M ajf+:l (if + 1) < M CTiC+1 (iT). Following inductively we 



obtain the assertion ( 28 ) 



A similar argument (which we omit) shows that: 

M CK (K - 1) > c/P M ak (K) > c/P, VK<K (29) 

Finally, suppose ax is an optimal threshold strategy. Then M UK (K — 



1) > c/P and M aic (K) < c/p. If the equalities hold strictly, (28) and (29) 
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guarantee that ok is the only optimal threshold strategy. If M CK (K — 1) = 
c/P (and hence, M aK {K) < c/j3), only gk and ok-\ are optimal threshold 
strategies. If M aK {K) = c//3 (and hence, M aK (K — 1) > c//3), only ax and 
(Tk+i are optimal threshold strategies. This completes the proof. ■ 

Proof of Theorem 3 This follows immediately from the representation of 
77+ and the definition of invariance . ■ 

Proof of Theorem 4 Given a protocol II = (a, a), let r/ n be the unique 
invariant distribution; let fi n be the fraction of agents who have no tokens 
and z/ n the fraction of agents who do not provide service; these depend only 
on II and not on the population parameters. If a = ^ i{K)ctk is a best 
response given the population parameters and /U n ,^ n , 7 must put strictly 
positive weight only on threshold strategies ok that are pure best responses. 
In view of Theorem 2, there are at most two threshold strategies that are 
pure best responses and they are at adjacent thresholds. That is, a is either 
a pure threshold strategy or a mixture of two adjacent threshold strategies, 
as asserted. ■ 

Proof of Theorem 5 Suppose to the contrary that II = (a, a) is a robust 
equilibrium protocol and that cr = Yl l(K) (J K is a proper mixed strategy, 
so that j{K) > for at least two values of the threshold K, Let /i u be the 
fraction of agents who have no tokens and z/ n the fraction of agents who 
do not provide service; these depend only on II and not on the population 
parameters. In view of Theorem 4, a must assign positive probability only 
to two adjacent threshold strategies; say a = ^{K)ok + l{K + l)ax+i with 
j(K) > and j(K + 1) > 0, and both ok,ok+\ must be best responses. 
Because (Jk(K + 1) = and uk+\{K + 1) = 1, equations (8), (9) (which 
provide necessary and sufficient conditions for optimality in terms of the 
true value function) entail that 

-c + PVk+i < PV K 
-c + PVk+i > PV K 

Hence — c + fiVx+i = {$Vk- Because &k is a best response, the value func- 
tions V aK must coincide with the true value function V . Hence, an agent 
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following ok must be indifferent to providing service when holding K to- 
kens. However, if /3 increases slightly M aK also increases, whence an agent 
following ok must strictly prefer to provide service. In other words, when (5 
increases slightly, ok can no longer be a best response and ax can no longer 
be an equilibrium protocol. This is a contradiction, so we conclude that 
a robust equilibrium protocol II cannot involve proper mixed strategies, as 
asserted. ■ 



Proof of Theorem 6 We divide the proof of (i) into several steps. 
Step 1 We first prove there exists f3 L G [0, 1) such that 

M a (K-l,P) < |for/3</3 L 
P 



M a (K-l,(3 L ) 



c 
P 



M (K-1,P) > -iorp>p 

To see this, define the auxiliary function 

F{fi) = M o {K-l,0)-^ 

F is evidently continuous. Lemma [2] guarantees that M a (K — 1, (3) is strictly 
increasing in f3, so F((3) is also strictly increasing in f5 as well. We show 
that F(l) > and lim^o F([3) < and then apply the intermediate value 
theorem to find p L . 

To see that F(l) > 0, note first that the coefficients in the left-hand 
matrix of (13) are simply 4>i = — p(l — u), (f> c = p(l — v + 1 — fj,) and 
cj) r = p(l — (j,). We split the matrix M CTif in two parts. To do this, write 



(30) 



u' = (p(l-u)c ... p(l-p)c) T 
u" = (p(l-i/)(6-c) ... 0) T 

and define M'o- x , M." aK to be the solutions to the equations 

$M' CTx = u', $M" aK = u" (31) 



Note that M aK = M' aK + and M ai{ is the solution to (13). It is easy 
to check that M' is a constant matrix: M'(k) = c for < k < K — 1. 
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Lemma [I] guarantees that the entries of M" A , are strictly positive: M"(k) > 
for < k < K — 1, Hence the entries of M CTK are strictly greater than c: 
M a (k) > c forO < k < K - 1. In particular, F(l) > 0. 



To see that lining F(/3) < 0, suppose not. Because F is strictly increas 
ing, this means F(f3) > for every /3 G (0, 1], which entails that M a {k) > | 
for < k < if — 1. Summing the rows in {1 131) yields: 



p(l - i/)6 + p(l - m)c> - /3) 



/3 



T 



(32) 



Note that Kc/ (3 flows up as /3 — )• 0, so this is impossible. We conclude that 
lim^o < 0) as asserted. 

Because F is strictly increasing, the intermediate value theorem guaran- 
tees that we can find an unique f3 L such that 



for p < f3 L 

for p > f3 L 
The definition of F yields the desired property of j3 L 



F(f3) 

F(P L ) 

HP) 



< 
= 
> 



Step 2 Next we prove there exists f3 H G (P L ,1) such that if ft G [0,/3 H ] 
then 



m* k A k 



< 



> 



-4>i P 

b c + <p r C 

~<Pl P 
b c + <p T C 

-4>i P 



for (3 < (3 H 



for p > (3 H 



To see this, note first that ^ ± P L % 

' -<pi p 

another auxiliary function: 

G(p) = M n (K-l,p)-(l 



1 



i 



+ 



i 



p(l-u) ^ p(l-v)P 



% and define 



1 



+ 



1 



G is continuous and increasing. Prom Step 1 it follows that M aK (K — 1, 1) > 
csoG(l) = M aK (K-l,l)-c> 0. It also follows that M aK (K-l,p L ) = -A; 



because (1 



we conclude that G(P L ) < 0. 
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Because G is continuous and increasing, there is a unique /3 G (f3 L , 1) such 
that 

G(fi) < for (3 < @ H 
G{I3 H ) = 

G(P) > for p > p L 

Step 3 The definitions of F, G imply that in order for II to be an equilibrium 
protocol when the discount factor is (3 it is the necessary and sufficient 
condition that F{(3) > and G(f3) < 0. Hence II is an equilibrium protocol 
when the discount factor is (3 exactly for (3 6 [(3 L ,/3 H ]. 

Because F, G are continuous in all their arguments and strictly increas- 
ing, f3 L ,(3 H , which are the zeroes of F,G, are continuous functions of the 
parameters as well. This completes the proof of (i). 

The proof of (ii) is similar and left to the reader. ■ 



Proof of Theorem 7 We first consider (i). Fix r. Consider the two proto- 
cols lift- = (K/2, ax) and Hr+i = ((K + 1) /2, uk+i) and the corresponding 
intervals [f3±,Pi] and °f discount factors that sustain equilibrium. 

We need to show that 

f3[ < $ < /3f < (3§ 

(The sustainable ranges overlap between two consecutive threshold protocols 
overlap but are not nested.) There are three inequalities to be established; 
we carry out the analyses in (A), (B), (C) below. 

There are 

(A) To prove @% > ^ , write (3 = (3f . We show that M aK+1 (K) < j. 
To see this, suppose not; i.e. M aK+1 {K) > %. The construction of (3^ 
guarantees that M aK (K— 1) = c//3. We will use this inequality and equality 
to show that all marginal payoffs of IIx+i so large that they violate the 
restrictions imposed by the bounded benefit b and cost c. 

To simplify the notation, let ux = ^^(^ — 1)^- Note ujk+i < uk- 



Then the matrix identity ( 13 ) becomes: 
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M ax (0) 
M ax (l) 

M ax {X-l) 







+ 2 -1 ••• 

-1 ojx + 2 -1 : 
-1 ojx + 2 -1 



o o -i ^ + 2j XxX L - - 

(33) 

Suppose M aK+1 (K) > M aK (K — 1) = |. We investigate the relation 
between M aK+1 (K — 1) and M UK (K — 2). Using the matrix identity, 



M aK+1 (K - 1) K+i + 2)M CTK+1 (K) 



M aK (K-2) {u K + 2)M CTK {K-l) 



* {uj K + 2)M aK (K-l) ' u K + l 



Moreover if2<k<K-l then 



M, 



(K - k) {uk+i + 1) [M aK+1 (K) + M aK+1 (K-k+ 1)] 



M aic (K-k-l) 
By induction, 



(oj k + 1) [M CTX (K - 1) + M CTK (K - fc)] - f 



M aK+1 (K-k) 
M aK (K-k-l] 



> 



> 1 



LUK+l + 1 
+ 1 



> 



> 



K + l 



> 1- 



1 



{K + l) 2 K + 2 



(K + l) 2 
,V0 < k < K- 1 



Next we prove M CTif+1 (0) > M aK (0). This is relatively easy since, if 
M aK (0) < M aK (0), then using the marginal payoff matrix and by in- 
duction, M aK+1 (K — 1) < Mfj K {K — 1) = %. This is a contradiction to 
M aK+1 (K - 1) > M aK+1 (K) = | . Therefore, M CTK+1 (0) > M tTK (0) . 

The marginal payoffs are bounded as follows, 

x-i 

(M ax (0) + M ax (X-1))+lj x J2 M *x (*0 = V/3 + c//3 (34) 

fc=0 
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However, since 

K K + l K 

k=0 k=l 

K + 1K(K + 2)K + 1 ^ 
> K {K + lYK + 2 U)K ^ M ^ k) 

K-l 
k=0 

and M aK+1 (Q) + M aK+1 (K) > M aK (0) + M UK (K - 1), a contradiction oc 

c 



curs. Therefore, for (5 = M ffK+1 (K) < §. This means $ > f${. This 
completes (A). 

(B) To prove fi^ > fi\ ■, let /3 = (3^ , we need to show that the protocol 
Hk+i must have M aK+1 (K + 1) < c/f3. We use contradiction to prove 
this. The idea is: Suppose M aK+1 (K + 1) > c//3, then we show that all 
the marginal payoffs of Hr+i are large enough such that they violate the 
restriction imposed by the bounded benefit b and cost c. 

Suppose M UK+l (K + 1) > M rTK (K) = c/(3. According to the matrix 
equation, similar to part (A), by induction we can get, 

M^K + l-k)^.^^^^ { K + lf < R 



u K+1 {K + i-k) f uK+i + i y 

M aK {K-k) \ uj K + l ) 



K(K + 2) 2 '' 

Also M aK+1 (0) > M aK (0). The marginal payoffs are bounded as follows, 

x 

{M ux (0) + M ax (X)) + 0J X Y^ M ax (k) = b/p + c/p (35) 



fc=0 



However, since 

K+i K + 2 K+1 

k=0 k=l 

K + 2K{K + 2) (K + l) 3 ^ 
> K + l (if + 1)2 K^RTW^t^ } 

K 

= u K Y J M f7K (k) 



k=0 
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and M CTK+1 (0) + M aK+1 (K + 1) > M aK (0) + M aK (K), a contradiction 
occurs. Therefore, for /3 = /3f , M ffK+1 (if + 1) < |. This means /3f > 0f . 
This completes part (B). 

(C) To prove /ftp < /3f , wreite /3 = /3f . We show that M aK+1 {K) > 
M aK (K) = f ■ If not, then as in (A) we must have M^^if) < M t7K (K) = 
in that case we show M aK+l (k) < M aK (k) for < A; < K. This will again 
violate the restrictions imposed by b and c. 

We extend the marginal payoff matrix in ( |33[ ) from if x i^T to (K + 1) x 
(K + 1) and incorporate M aK (K). If M aK {K) = %, such extension does not 
change the solution of the marginal payoffs M aK (k),\/k £ [0,K]. Note the 
new coefficient matrix has the same size of the coefficient matrix for ax+i- 
Suppose M aK+1 (K) < M aK (K) = %. According to the matrix equation, 

M UK+1 {K - 1) _ {uk+i + 2)M aK+1 (K) -c/P 
M GK {K — 1) {oo K + 2)M aK (K) - c/0 

Moreover, for < k < K we have 

M aK+1 (K-k) = (co K+1 + l)[M aK+1 {K) + M aK+1 (K - k + 1)] - c/g 
M CTX - fe) (w x + 1) \M OK (K) + M aK {K-k + l)\- c/0 

By induction, M aK+1 (k) < M aR . (k) < k < K . However, since 

x-i 

(M ax (0) + M ax (X - 1)) + cox M *x ( k ) = VP + c/P (36) 

Again, the left-hand side is bigger when X = K than when X = K + 1, 
which is a contradiction. This completes part (C). 

Combining (A), (B) and (C) establishes the desired string of inequalities. 
The remaining conclusions of (i) follow immediately. 

The argument for (ii) is very similar and left to the reader. ■ 



Proof of Theorem 8 It is convenient to first solve the following simple 
maximization problem: 

maximize E*(x\,X2) = 1 — x\ — X2 + x\X2 

0<xi,x2<i (37) 

subject to xi(l — x\) K = £2(1 — X2) K 
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To solve this problem, set f(x) = x(l — x) . A straightforward calculus 
exercise shows that if < x\ < < x^ < 1 and f(x\) = f{x2) then: 

(a) x\ + X2 > #qrp with equality achieved at x\ = X2 = ^W- 

(b) x\X2 < ^Vj, with equality achieved at x\ = X2 = ^TT- 

Putting (a) and (b) together shows that the optimal solution to the maxi- 

, 2 



mization problem (|37|) is to have x\ = X2 = and maxE* = yl 

Now fix a protocol II = (a, ax) and let r/ n be the corresponding invariant 
distribution. If we take x\ = fi u ,X2 = then our characterization of the 
invariant distribution shows that f(x\) = f{x2)- By definition, Eff(II) = 

E*{x\,x?) so 

Eff(n) < max£* = (1 



1 ^ 2 



K + l 



On the other hand, if a = K/2 then the invariant distribution has r) n (k) = 
j^ri for all k and 

Eff(K/2, o K ) = (l- j^-A = [K/(K + l)] 2 

Taken together, part (ii) and (hi) are proved.. 

Next fix a protocol (a, ok)- Let \a\ be the least integer greater than or 
equal to a and set K* = 2 \a] . There are two cases to consider. 

In the first case, K < K* . 

1 \ 2 (. 1 \ 2 A 1 



< y-jrpi) - [ 1 -wti) = y 2 W+ i 

which is the desired result in the first case. 

In the second case, K > K*. Define the protocol II' = ([a],<r^); let rj 
be the invariant token distribution for II'. Let II* = (\a~\, ax*)', note that 
the invariant token distribution rf is uniform {rj*{k) = K } +1 = 2[aT+I ^ or 
all k = 0, 1, K*). Note that II' and II have the same strategy component 
but that the token supply for II' is larger than for II, and that II' and II* 
have the same token supply but that the strategy component of II' has a 
higher threshold. 
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We assert that r/(0) > 2 [a]+r ^ no * then r ?'(^) < 2Tc^+i = if *+i ' ^ 
follows that for all k € {0, 1, K} we have r/(A;) < k*+i = r ?*(^)- Hence 



x* if* 



r«oi = E = E - ^)) + E 

fc=0 k=0 k=0 

K* K* K* K* 

<k* E(^*( fc ) - v'm + e = - e + E 

fe=0 A;=0 k=0 k=0 

K K* K K* 

=k* e i/(fc)+s*V(*)< E fcr? , (fe)+E^ , w = r«oi 



This is a contradiction. Hence, r/(0) > 2 [al+i ' 

Because the token supply for II is less than IT', the number of agents 
with no tokens is larger, so 7/(0) > r/(0) > 2 f a ]+i ■ Hence 

Eff(n) = (1 - r?(0))(l - rj(K)) < (1 - r?(0)) < f 1 - ' 



2[a] +1 



which is the desired result in the second case. This complete the proof for 
part (i). 



Proof of Theorem 9 Both assertions follow immediately by combining 
Theorems 7 and 8. ■ 

Proof of Theorem 10 We first derive the lower bound K L . If 11^- = 
(K/2, ok) is an equilibrium protocol then consecutive marginal utilities bear 
the relationship 

<f>iM aK (k - 1) + <t> c M aK {k) = -4> r M GK (k + 1) > 

(Because j3 is fixed, we suppress it in the notation.) Therefore, M aK (k) > 
=^M aK (k - 1). By induction, 
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Because (/> C M CTK (0) = (1 — v) pb — 4> r M aK {\) > (1 — v)pb + (1 — v)pc, we have 

(l-v)pb pfl b + c 



Therefore, 



Because n^is assumed to be an equilibrium protocol, we must have 
M aK (K) < c//3. Moreover, we must also have 

p/3 \ K+1 b + c c 



because otherwise M aK (K) > c//3. Therefore, 

K > maxj log pf> — 1,0} (39) 

2(l-/3)+2p0 b+ C 

This provides the lower bound K L . 

We now derive the upper bound K H . Rewriting the relation between 
consecutive marginal utilities we obtain 

= faM aK {k - 1) + cf) c M aK {k) + ^ r M aK {k + 1) 
> (j>iM aK (k - 1) + (<f> c + (f> r )M aK (k) 

Therefore, M aK (k) < -^^M UK {k- 1). By induction, 

Because cf> c M aK (0) = {l-v)pb-^ r M aK {l) < (l-u)pb-<f> r b/P = 2(l-is)pb, 
we have, 

Therefore, 



< 



M aK (k)< ( r 



p/3 \ fe+1 26 



^pW * (40) 
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Because Uk is assumed to be an equilibrium protocol, we must have 
M UK {K - 1) > c//3. Moreover, 

pj3 \ K 2b c 



> 



l-(3 + p(3j 13 ~ 13 
because otherwise M aK (K — 1) < c/(3. Therefore, 



c 



K < log^_ - (41) 

1-/3+P/3 2b 

This provides the upper bound K H . 

Combining the two estimates yields the range containing all integers K 
for which Uk is an equilibrium protocol. The estimate for efficiency follows 
immediately since Eff(U.K) > Eff{U. K h) if if > -ftT L , so the proof is com- 
plete. ■ 
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